Visualize Data in a Monitoring System

Learn a novel way to visualize an enormous amount of monitoring data.

Large data centers have millions of servers, and visualizing the health data for all of them is challenging. An important aspect of monitoring a fleet of servers is to know which ones are alive and which ones are offline. A modern data center can house many thousands of servers in a building. We can use a heat map to display information about thousands of servers compactly in a data center.

A heat map is a data visualization technique that shows the magnitude of a phenomenon in two dimensions by using colors.

Using heat maps to troubleshoot#

We’ll identify if a server is down by using heat maps. Each rack of servers is named and is sorted by data center, then cluster, then row, so problems common at any of these levels are readily apparent.

A heat map depicting the operational state of a large number of components is an effective method. The health of each component is indicated by the color of each cell in a big matrix. Nodes with green cells operate within permitted parameters, while nodes with red cells are nonresponsive on multiple tries.

Below, we have a heat map displaying the server’s state.

Cluster 1
Cluster 1
Cluster 2
Cluster 2
Viewer does not support full SVG 1.1
Viewing servers in a data center using heat maps

We can use heat maps for the globally distributed systems and continuously share the health information of a server. We can use one bit (one for live, zero for dead). For 1,000,000 servers, we have 125 KB of data. We can quickly find out which server is down by the red color and focus on the problematic parts.

We can create similar heat maps to get a bird’s-eye view of any resource, like filesystems, networking switches, links, and so on.

Summary#

  • Monitoring systems are critical in distributed systems because they help in analyzing the system and alerting the stakeholders if a problem occurs.

  • We can make a monitoring system scalable using a hybrid of the push and pull methods.

  • Heat maps are a powerful tool for visualization and help us learn about the health of thousands of servers in a compact space.

Detailed Design of a Monitoring System
Focus on Client-side Errors in a Monitoring System
Completed